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We previously reported the detection of bat coronaviruses (bat CoVs 1A, 1B, HKU7, HKU8 and 
bat-severe acute respiratory syndrome coronavirus) in Miniopterus spp. that cohabit a cave in 
Hong Kong. Here, we report the full genomic sequences of bat CoVs 1A, 1B and HKU8. Bat 
CoVs 1A and 1B, which are commonly found in the Miniopterus, are phylogenetically closely 
related. Using species-specific RT-PCR assays, bat CoVs 1A and 1B were confirmed to have 
distinct host specificities to Miniopterus magnater and Miniopterus pusillus, respectively. 
Interestingly, co-infections of bat CoVs 1B and HKU8 in M. pusillus are detected in seven of 38 
virus-positive specimens collected from 2004 to 2006. These findings highlight that co-infections 
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of some coronaviruses might be common events in nature. The biological basis for the host 
restriction of bat coronaviruses, however, is yet to be determined. 


Coronavirus (CoV) is a genus of viruses in the family 
Coronaviridae under the order Nidovirales. Based on 
antigenic and genetic analyses, CoVs are divided into three 
major groups. The viruses have positive-sense, single- 
stranded RNA genomes with sizes ranging from 27000 to 
32000 nt (Gorbalenya et al, 2006). Their genomes are 
polycistronic and contain at least five major open reading 
frames (ORFs) with the characteristic gene order [5’- 
replicase (rep), spike (S), envelope (E), membrane (M) and 
nucleocapsid (N)-3']. The severe acute respiratory syn- 
drome (SARS) outbreak in 2003 highlighted the potential 
health threat of coronaviruses to humans (Peiris et al., 
2003). This prompted an intensive search for the precursor 
of SARS CoV in wildlife and the discovery of several novel 
CoVs in bats in China, including the viruses that are closely 
related to SARS CoV (Dong et al., 2007; Lau et al., 2005; 
Poon et al., 2005; Tang et al., 2006; Woo et al., 2006). 
Recently, the biological importance of bats for the ecology 
of coronaviruses was further reiterated by the discovery of 
novel bat CoVs (BtCoVs) in other continents (Dominguez 
et al., 2007; Muller et al., 2007). 


We previously reported the discovery of the first BtCoV 
(Poon et al., 2005). In addition, we also performed a 
longitudinal study on Miniopterus magnater and 
Miniopterus pusillus bats that cohabit an abandoned cave 


The GenBank/EMBL/DDBJ accession numbers for the sequences 
reported in this paper are EU420137-EU420139. 


Supplementary material is available with the online version of this paper. 


in Hong Kong (Chu et al., 2006). BtCoV HKU7 and bat- 
SARS-CoV were detected only once in our specimens, 
while BtCoVs 1A, 1B and HKU8 were detected repeatedly 
in these bats. A clear host species restriction was observed 
with BtCoV 1A being found in M. magnater and BtCoV 1B 
in M. pusillus even though these two CoVs are genetically 
very similar and the two bat species co-inhabit the same 
cave. Thus, these virus surveillance studies might allow 
unique opportunities to understand better the dynamics 
and prevalence of coronaviruses within a single geograph- 
ical location. To understand better the genetic relationships 
of the BtCoVs that are repeatedly found in the local 
Miniopterus spp, representative faecal samples positive for 
BtCoVs 1A, 1B and HKU8 in our previous investigations 
were selected for this study. Genomic sequences of these 
three viruses were determined and analysed as described in 
Supplementary Material (available in JGV Online). 


The genomes of BtCoVs 1A, 1B and HKU8, excluding the 
3' poly(A) tails, have sizes of 28309, 28460 and 28757 nt, 
respectively (GenBank accession nos EU420137- 
EU420139). The G+C content of 1A, 1B and HKU8 is 
38.2, 38.6 and 41.8 %, respectively. The genome organiza- 
tions of these CoVs are typical of group 1 CoVs (Table 1) 
and the amino acid sequence identities of the major viral 
proteins between these viruses are shown in Supplementary 
Table S1 (available in JGV Online). The 3’ UTRs and N 
genes of these viruses were reported previously and will not 
be discussed further (Chu et al., 2006). Interestingly, HKU8 
was found to contain an extra ORF at its 3’ end. This 
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Table 1. Predicted ORFs in the genomes of BtCoV 1A (strain AFCD62), BtCoV 1B (strain AFCD307) and BtCoV HKU8 (strain 


AFCD77) 
The nucleotide positions for nsp 1-16 are also shown in the table. Na, Not applicable 
BtCoV 1A BtCoV 1B BtCoV HKU8& 
Position Length (nt) Position Length (nt) Position Length (nt) 
ORFla 272-13078 12804 273-13247 12972 270-12965 12693 
ORF1b 13048-21069 8019 13217-21238 8019 12935-20959 8022 
nsp1 (ppla/pplab 272-859 588 273-860 588 270-323 54 
nsp2 (ppla/pplab 860-2950 2091 861-2948 2088 324-1811 1488 
nsp3 (ppla/pplab 2951-8290 5340 2949-8459 5511 1812-8174 6363 
nsp4 (ppla/pplab 8291-9712 1422 8460-9881 1422 8175-9599 1425 
nsp5 (ppla/pplab 9713-10618 906 9882-0787 906 9600-10505 906 
nsp6 (ppla/pplab 10619-11455 837 10788-11624 837 10506-11342 837 
nsp7 (ppla/pplab 11456-11704 249 11625-11873 249 11343-11591 249 
nsp8 (ppla/pplab 11705-12286 582 11874-12455 582 11592-12173 582 
nsp9 (ppla/pplab 12287-12616 330 12456-12785 330 12174-12503 330 
nsp10 (ppla/ 12617-13024 408 12786-13193 408 12504-12911 408 
pplab) 
nsp11 (ppla) 13025-13078 54 13194-13247 54 12912-12965 54 
nsp12 (pplab) 13025-15804 2781 13194-15973 2781 12912-15691 2781 
nsp13 (pplab) 15805-17595 1791 15974-17764 1791 15692-17482 1791 
nsp14 (pplab) 17596-19149 1554 17765-19318 1554 17483-19039 1557 
nsp15 (pplab) 19150-20166 1017 19319-20335 1017 19040-20056 1017 
nsp16 (pplab) 20167-21066 900 20336-21235 900 20057-20956 900 
S 21071-25198 4125 21240-25397 4155 20956-25083 4125 
ORF3 25198-25857 657 25366-26025 657 25083-25751 666 
E 25851-26075 222 26019-26243 222 25835-25959 222 
M 26082-26840 756 26250-27017 765 25966-26721 753 
N 26861-28030 1167 27038-28189 1149 26770-28038 1266 
OREF7 NA = << 27788-28534 744 


predicted ORF (designated ORF 7) overlaps with the N 
gene and encodes a putative protein with 248 aa residues 
(Table 1). 


The size of the 5’ UTR of 1A, 1B and HKU8 is 271, 272 and 
269 nt, respectively. Three stem-loop structures that were 
proposed to be conserved in group 1 CoVs (Liu et al. 
2007) and a core sequence of leader transcription 
regulatory sequence (5'-CUAAAC-3’) were identified in 
these 5’ UTR sequences (Supplementary Fig. S1 available in 
JGV Online). 


These ORFs 1 occupy approximately 70% of the genomes. 
ORF1 consists of ORFla and ORF1b that encodes viral ppla 
and pplab, respectively. Putative features responsible for the 
—1 ribosomal frame shifting, e.g. the ‘slippage sequence’ (5’- 
UUUAAAC-3’), were identified in the genomes (data not 
shown). Based on in silico analysis, the ppla and pplab 
proteins are predicted to be cleaved into a total of 16 non- 
structural proteins by virus proteases (Table 1) (Chen et al., 
2003). The predicted 3CL protease cleavage sites in these 
polyproteins (Supplementary Table S2 available in JGV 
Online) agree with previously defined substrate specificity 
for other 3CL proteases (Ziebuhr et al., 2000). All pplab of 


BtCoV 1A, 1B and HKU8 contain functional units that are 
typical for CoVs. For example, all these viruses contain two 
papain-like (PL) protease domains in nsp3, a 3C-like (3CL) 
protease in nsp5, CoV RNA-dependent RNA polymerase 
(RdRp) in nsp12, RNA helicase (Hel) in nsp13 and an 
mRNA cap-1 methyltransferase (MT) functional domain in 
nsp16. It was noted that the putative nsp1 of HKU8 has a 
size of 18 aa, which is significantly smaller than sizes of nsp1 
predicted in other group 1 CoVs. 


Similar to other CoVs, the N-terminal regions of these S 
genes are genetically diverse. The protein sequence 
identities between the putative S1 domains (the first 740 
residues approximately) of HKU8 to 1A, HKU8 to 1B, and 
1A to 1B are 38.5, 38.7 and 86.6%, respectively. The 
corresponding sequence identities for the putative S2 
domains are 56.2 (HKU8 to 1A), 55.9 (HKU8 to 1B) and 
91.3% (1A to 1B), respectively. Twenty nine, 31 and 30 
potential N-glycosylation sites were predicted in 1A, 1B 
and HKU8 S proteins, respectively. Two putative heptad 
repeat (HR) regions, which are conserved in all CoVs 
(Bosch et al., 2003), were identified in these S proteins 
(both 1A and 1B: HR1, aa 973-1122; HR2, aa 1252-1311. 
HKU8: HRI, aa 976-1121; HR2, aa 1261-1317). 
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ORF3 of 1A and 1B have the same size of 660 nt and that of 
HKU8 is 669 nt. In terms of amino acid sequence 
similarity, ORF3 is the least conserved gene amongst these 
three viruses (Supplementary Table S3 available in JGV 
Online). Signal anchors in these ORF3 proteins were 
predicted with the probability of 0.545—-0.689 by SignalP. 
Multiple transmembrane motifs were also predicted in 
these ORF3 proteins, suggesting these proteins might be 
surface proteins. 


The E genes of these viruses are both the same size of 
225 nt. One transmembrane signal was predicted in each of 
these E proteins (P>0.990), with the N terminus exposed 
to the surface. 


Signal peptides were predicted in these M proteins 
(P>0.933). Cleavage sites were located between aa 17 and 
18 in these M proteins. The N-terminal signal peptide was 
not observed in other BtCoV M proteins, including BtCoV/ 
512/2005 and HKU2. Three strong transmembrane helices 
and one marginal transmembrane helix were predicted 
with the N terminus exposed to the surface. It has been 
reported that the M protein of the transmissible gastro- 
enteritis virus (TGEV) adopts two different topologies in 
the virus envelope with three transmembrane helices and 
the C terminus located inside the virus particle, or with 
four transmembrane helices and the C terminus exposed to 
the virion surface (Escors et al., 2001). This may also be the 
case for M proteins of these group 1 BtCoVs. 


ORF7 is present only in HKU8, and it is located 
downstream of the N protein. A BLAST search of the 
ORF7 amino acid sequence revealed no homologous gene 
in GenBank. Prediction of an ORF (or ORFs) downstream 
of the N gene was also reported in the genomes of some 
group 1 CoVs, including TGEV (GenBank accession no. 
NC_002306), feline infectious peritonitis virus (FIPV) 
(GenBank accession no. AY994055), BtCoV/512/2005 
(Tang et al., 2006) and BtCoV HKU2 (GenBank accession 
no. NC_009988). ORF7 in TGEV was shown to produce an 
endoplasmic reticulum associated protein, suggesting it 
plays a role in the association of replication complexes or 
in virion assembly (Tung et al., 1992). The ORF7ab of 
FIPV has been suggested to affect virulence (Haijema 
et al., 2004). However, there is no significant sequence 
similarity between these ORFs and it is likely that these 
viruses have acquired their unique ORFs from different 
sources. 


The putative core sequence 5’-CUAAAC-3’ in the 
transcription regulatory sequence (TRS), which was 
identified in other group 1 CoVs (Dye & Siddell, 2005; 
Hiscox et al., 1995; Lau et al., 2007; Pyrc et al., 2004), is 
present in the leader TRSs of the studied genomes. They 
start at position 67 in BtCoV 1A and position 68 in both 
BtCoV 1B and HKU8. All the core TRS sequences are 
highly similar to its corresponding leader TRS 
(Supplementary Fig. $2) except the TRSs for all ORF3 
and the E gene of HKU8. It has been reported that the 
leader TRS can also join the body segment at an unusual 


consensus sequence preceding the ORFs without the 
presence of the core sequence (Hofmann et al., 1993). 
However, we were not able to demonstrate this in these bat 
viruses because none of these viruses could be cultured in 
vitro and we have no means of obtaining sufficient viral 
RNA for detailed investigation. Thus, the authentic TRSs of 
these bat viruses, in particular those for ORF3, are yet to be 
confirmed. 


Phylogenetic analyses of RdRp, helicase, S$, E, M and N 
proteins from these BtCoVs were performed. Mouse 
hepatitis virus was included as an out-group in the 
analyses. In all cases, the three viruses studied formed a 
clade that is distinct to group 1 members isolated from 
other animals (Fig. 1). The BtCoVs 1A, 1B and HKU8 
viruses phylogenetically cluster with BtCoVs Shandong/ 
977/2006 and Fujian/773/2005 detected in mainland China 
(Fig. 1). Sequence analysis on partial RdRp sequences 
revealed that the Fujian/773/2005 found in Miniopterus 
schreibersii could be a HKU7-like virus (not shown). The 
previously published S gene and helicase gene sequences of 
BtCoV Shandong/977/2006 (Vijaykrishna et al., 2007), 
which was detected in bats of the species Rhinolophus 
ferrumequinum, show a close genetic relationship to the 
three CoVs described in this report. The S protein amino 
acid sequence similarity between BtCoV Shandong/977/ 
2006 and BtCoV 1A is 46.5 %, while BtCoV 1A to HKU8 is 
46.4 % (Supplementary Table S3). A similar relationship was 
also observed for the helicase gene (Shandong/977/2006 to 
1A: 74.2%; Shandong/977/2006 to HKU8: 75.5%; 1A to 
HKU8: 76.5 %). Therefore, CoVs related to that found in the 
genus Miniopterus could also be found in other genera of 
bats. Considering the relationship between these CoV S 
proteins, which is one of the major determinants of host 
species specificity of infection (Dveksler et al., 1991), and the 
relatively low detection rates of BtCoVs HKU8 and HKU7 in 
Miniopterus (Chu et al., 2006), it is possible that the two 
viruses are naturally harboured in species of bats other than 
Miniopterus bats. Overall, the above findings highlight that 
some BtCoVs might have the ability for interspecies 
transmission, an event that is relevant to the genesis of 
SARS CoVs in humans. 


It has been previously shown that recombination is not a 
rare event in CoVs (Makino et al., 1986). To detect possible 
recombination between the bat viruses we studied, which 
were all detected in Miniopterus bats found at a single 
habitat, genome sequences of BtCoVs 1A, 1B and HKU8 
were aligned and examined by SimPlot and BootScan 
analyses. In addition, these genomes were also aligned with 
other full-length group 1 BtCoV genomes, BtCoV/512/ 
2005 and BtCoV HKU2, for comparison. Bat-SARS-CoV 
was included in the analyses as this virus had been detected 
once in our study (Chu et al., 2006). No evidence of 
recombination between these genomes was detected (data 
not shown). 


The limitation of direct sequencing of PCR amplicons is 
that the presence of co-infections may be masked by the 
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Fig. 1. Phylogenetic analyses of RdRp, Hel, S, E, M and N protein amino acid sequences. S protein of BtCoV HKU2 was not 
included in the analysis as the sequence similarity of this protein to that of all other group 1 CoVs was extremely low (Lau et ai., 
2007). MHV was included as an out-group in the analyses. Numbers at nodes indicate bootstrap values as a percentage. Bars 
show the estimated genetic distance of these viruses. GenBank accession numbers of the reference sequences and definition 
of abbreviations are listed in Supplementary Material (available in JGV Online). 
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sequence of the dominant virus. Given the high detection 
rates of BtCoVs 1A and 1B in our specimens and by the 
apparent species restriction of these two viruses to two 
different bat species that co-inhabit the same cave, we 
designed species-specific PCR assays for detecting BtCoVs 
1A, 1B and HKU8 in our specimens. We applied these 
assays to all specimens that were positive in our CoV 
consensus screening RT-PCR in order to detect the 
presence of co-infections. These viruses were selected for 
screening as they were repeatedly detected in Miniopterus 
bats found within the same cave in Hong Kong. A total of 
38 specimens collected in a period from 2004 to 2006 in 
Hong Kong, which were known to be positive for CoV 
(Chu et al., 2006), were tested by six different species- 
specific RT-PCR assays (RdRp and N genes of BtCoV 1A, 
1B and HKU8). The identities of the amplified products 
were all confirmed by sequencing. In summary, infection 
with 1A alone was detected in 16 specimens collected from 
M. magnater, infection with 1B alone was detected in 14 
specimens collected from M. pusillus and infection with 
BtCoV HKU8 alone was detected in one specimen collected 
from M. magnater. There was no evidence of co-infections 
with the more genetically closely related viruses 1A and 1B, 
even though these two viruses co-habited the same cave 
and were more frequently detected in our specimens from 
their corresponding hosts. These results suggest that there 
is a strong host restriction for these two viruses that 
apparently hinders interspecies transmission between these 
two species of bat. Interestingly, co-infections of 1B and 
HKU8 were detected in seven M. pusillus specimens 
collected in 2004 (n=2) and 2006 (n=5) (Table 2). 


The partial N gene sequences (>500 nt) derived from the 
co-infected samples were aligned and a phylogenetic tree 
was constructed (Supplementary Fig. $3 available in JGV 
Online). The analysis revealed that the N gene sequences of 
the 1B viruses were not identical (similarities ranging from 
96.8 to 98.8 %), confirming that our findings were not due 
to cross-contamination. On the other hand, three different 
HKU8 N genes sequences were detected (similarities 


ranging from 96.0 to 98.1%) (Supplementary Fig. S3 
available in JGV Online). HKU8 N gene sequences deduced 
from four of five specimens collected in the same sampling 
trip in 2006 were found to be identical. The detection of a 
genetically identical virus in these four specimens could 
represent a recent spread of a single infecting strain within 
this bat population, but the alternative possibility of cross- 
contamination at sampling or subsequent procedures 
cannot be conclusively excluded. However, the finding of 
HKU8 and 1B co-infections in samples AFCD337 and 
WCF4 with unique sequences in each of these specimens 
(Table 2) and the fact that such co-infections have been 
detected in multiple sampling occasions (2004 and 2006) 
strongly suggest that co-infections of these two viruses in 
M. pusillus occur in nature and are not isolated events. 


In this report, we describe the full genomes of three group 
1 BtCoVs. Co-infection with different CoVs in their host 
provides chances for producing recombinants between 
these CoVs. In our study, there was no evidence of 
recombination events detected between the reported 
genomes of BtCoV 1A, 1B and HKU8. But the diversity 
of CoVs found in bats and the detection of BtCoV co- 
infections suggest bats might play a key role in shaping the 
evolution and ecology of CoV. The availability of genetic 
information for these viruses helps us better understand 
the taxonomy and evolution of CoVs in general. But more 
importantly, since SARS and a number of other new 
human diseases have emerged through interspecies trans- 
mission of viruses harboured by bats, the biological basis of 
host restriction of bat CoVs remains an important subject 
for study for global public health. 
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